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<2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..1350 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG GCT GAA ATT GTT GTC TCC AAA TTT GGC GGT ACC AGC GTA GCT GAT 48 

Met Ala Glu lie Val Val Ser Lys Phe Gly Gly Thr Ser Val Ala Asp 
1 5 10 15 

c 

g TTT GAC GCC ATG AAC CGC AGC GCT GAT ATT GTG CTT TCT GAT GCC AAC 96 

fy Phe Asp Ala Met Asn Arg Ser Ala Asp lie Val Leu Ser Asp Ala Asn 

□ 

£>; GTG CGT TTA GTT GTC CTC TCG GCT TCT GCT GGT ATC ACT AAT CTG CTG 144 

ff% Val Arg Leu Val Val Leu Ser Ala Ser Ala Gly lie Thr Asn Leu Leu 

35 40 45 

«>■ GTC GCT TTA GCT GAA GGA CTG GAA CCT GGC GAG CGA TTC GAA AAA CTC 192 

i y 

Val Ala Leu Ala Glu Gly Leu Glu Pro Gly Glu Arg Phe Glu Lys Leu 

L 50 55 60 

H • 

GAC GCT ATC CGC AAC ATC CAG TTT GCC ATT CTG GAA CGT CTG CGT TAC 240 
Asp Ala lie Arg Asn lie Gin Phe Ala lie Leu Glu Arg Leu Arg Tyr 
65 70 75 80 

CCG AAC GTT ATC CGT GAA GAG ATT GAA CGT CTG CTG GAG AAC ATT ACT 288 
Pro Asn Val lie Arg Glu Glu lie Glu Arg Leu Leu Glu Asn lie Thr 
85 90 95 

GTT CTG GCA GAA GCG GCG GCG CTG GCA ACG TCT CCG GCG CTG ACA GAT 336 
Val Leu Ala Glu Ala Ala Ala Leu Ala Thr Ser Pro Ala Leu Thr Asp 
100 105 110 

GAG CTG GTC AGC CAC GGC GAG CTG ATG TCG ACC CTG CTG TTT GTT GAG 384 
Glu Leu Val Ser His Gly Glu Leu Met Ser Thr Leu Leu Phe Val Glu 
115 120 125 

ATC CTG CGC GAA CGC GAT GTT CAG GCA CAG TGG TTT GAT GTA CGT AAA 4 32 
lie Leu Arg Glu Arg Asp Val Gin Ala Gin Trp Phe Asp Val Arg Lys 
130 135 140 

GTG ATG CGT ACC AAC GAC CGA TTT GGT CGT GCA GAG CCA GAT ATA GCC 480 
Val Met Arg Thr Asn Asp Arg Phe Gly Arg Ala Glu Pro Asp lie Ala 
145 150 155 160 

GCG CTG GCG GAA CTG GCC GCG CTG CAG CTG CTC CCA CGT CTC AAT GAA 528 
Ala Leu Ala Glu Leu Ala Ala Leu Gin Leu Leu Pro Arg Leu Asn Glu 
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165 170 175 

GGC TTA GTG ATC ACC CAG GGA TTT ATC GGT AGC GAA AAT AAA GGT CGT 576 
Gly Leu Val lie Thr Gin Gly Phe lie Gly Ser Glu Asn Lys Gly Arg 
180 185 190 

ACA ACG ACG CTT GGC CGT GGA GGC AGC GAT TAT ACG GCA GCC TTG CTG 624 
Thr Thr Thr Leu Gly Arg Gly Gly Ser Asp Tyr Thr Ala Ala Leu Leu 
195 200 205 

GCG GAG GCT TTA CAC GCA TCT CGT GTT GAT ATC TGG ACC GAC GTC CCG 672 
Ala Glu Ala Leu His Ala Ser Arg Val Asp lie Trp Thr Asp Val Pro 
210 215 220 

GGC ATC TAC ACC ACC GAT CCA CGC GTA GTT TCC GCA GCA AAA CGC ATT 720 
Gly lie Tyr Thr Thr Asp Pro Arg Val Val Ser Ala Ala Lys Arg lie 
225 230 235 240 

GAT GAA ATC GCG TTT GCC GAA GCG GCA GAG ATG GCA ACT TTT GGT GCA 768 
p Asp Glu lie Ala Phe Ala Glu Ala Ala Glu Met Ala Thr Phe Gly Ala 

Q 245 250 255 

m 

y AAA GTA CTG CAT CCG GCA ACG TTG CTA CCC GCA GTA CGC AGC GAT ATC 816 

Q Lys Val Leu His Pro Ala Thr Leu Leu Pro Ala Val Arg Ser Asp lie 

C?! 260 265 270 

m 

£ CCG GTC TTT GTC GGC TCC AGC AAA GAC CCA CGC GCA GGT GGT ACG CTG 864 

y, Pro Val Phe Val Gly Ser Ser Lys Asp Pro Arg Ala Gly Gly Thr Leu 

pj 275 280 285 

|«b 

%j GTG TGC AAT AAA ACT GAA AAT CCG CCG CTG TTC CGC GCT CTG GCG CTT. 912 

q Val Cys Asn Lys Thr Glu Asn Pro Pro Leu Phe Arg Ala Leu Ala Leu 

H 290 295 300 

CGT CGC AAT CAG ACT CTG CTC ACT TTG CAC AGC CTG AAT ATG CTG CAT 960 
Arg Arg Asn Gin Thr Leu Leu Thr Leu His Ser Leu Asn^M^T^Leu His 
305 310 315 ^ 320 

TCT CGC GGT TTC CTC GCG GAA GTT TTC GGC ATC CTC GCG CGG CAT AAT 1008 
Ser Arg Gly Phe Leu Ala Glu Val Phe Gly lie Leu Ala Arg His Asn 
325 330 335 

ATT TCG GTA GAC TTA ATC ACC ACG TCA GAA GTG AGC GTG GCA TTA ACC 1056 
He Ser Val Asp Leu He Thr Thr Ser Glu Val Ser Val Ala Leu Thr 
340 345 350 

CTT GAT ACC ACC GGT TCA ACC TCC ACT GGC GAT ACG TTG CTG ACG CAA 1104 
Leu Asp Thr Thr Gly Ser Thr Ser Thr Gly Asp Thr Leu Leu Thr Gin 
355 360 365 

TCT CTG CTG ATG GAG CTT TCC GCA CTG TGT CGG GTG GAG GTG GAA GAA 1152 
Ser Leu Leu Met Glu Leu Ser Ala Leu Cys Arg Val Glu Val Glu Glu 
370 375 380 

GGT CTG GCG CTG GTC GCG TTG ATT GGC AAT GAC CTG TCA AAA GCC TGC 1200 
Gly Leu Ala Leu Val Ala Leu He Gly Asn Asp Leu Ser Lys Ala Cys 
385 390 395 400 
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ru 

hi 

■cr=3 

Q 



GCC GTT GGC AAA GAG GTA TTC GGC GTA CTG GAA CCG TTC AAC ATT CGC 1248 
Ala Val Gly Lys Glu Val Phe Gly Val Leu Glu Pro Phe Asn lie Arg 
405 410 415 

ATG ATT TGT TAT GGC GCA TCC AGC CAT AAC CTG TGC TTC CTG GTG CCC 1296 
Met lie Cys Tyr Gly Ala Ser Ser His Asn Leu Cys Phe Leu Val Pro 
420 425 430 

GGC GAA GAT GCC GAG CAG GTG GTG CAA AAA CTG CAT AGT AAT TTG TTT 1344 
Gly Glu Asp Ala Glu Gin Val Val Gin Lys Leu His Ser Asn Leu Phe 
435 440 445 

GAG TAA 1350 
Glu * 
450 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GATCCATGGC TGAAATTGTT GTCTCCAAAT TTGGCG 36 

(2) INFORMATION FOR SEQ ID NO: 3: 

j-f (i) SEQUENCE CHARACTERISTICS: 

' <A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GTACCGCCAA ATTTGGAGAC AACAATTTCA GCCATG 36 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CCCGGGCCAT GGCTACAGGT TTAACAGCTA AGACCGGAGT AGAGCACT 48 
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(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



GATATCGAAT TCTCATTATA GAACTCCAGC TTTTTTC 37 
(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

y, (A) LENGTH: 917 base pairs 

H (B) TYPE: nucleic acid 

P (C) STRANDEDNESS: single 

hi (D) TOPOLOGY: linear 

n (ii) MOLECULE TYPE: DNA (genomic) 

^ (ix) FEATURE: 

* (A) NAME /KEY: CDS 

j\ (B) LOCATION: 3.. 911 

jjjj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

J? CC ATG GCT ACA GGT TTA ACA GCT AAG ACC GGA GTA GAG CAC TTC GGC 47 
P Met Ala Thr Gly Leu Thr Ala Lys Thr Gly Val Glu His Phe Gly 

^ 5 15 10 15 

ACC GTT GGA GTA GCA ATG GTT ACT CCA TTC ACG GAA TCC GGA GAC ATC 95 
Thr Val Gly Val Ala Met Val Thr Pro Phe Thr Glu Ser Gly Asp lie 
20 25 30 



GAT ATC GCT GCT GGC CGC GAA GTC GCG GCT TAT TTG GTT GAT AAG GGC 143 
Asp lie Ala Ala Gly Arg Glu Val Ala Ala Tyr Leu Val Asp Lys Gly 
35 40 45 



TTG GAT TCT TTG GTT CTC GCG GGC ACC ACT GGT GAA TCC CCA ACG ACA 191 
Leu Asp Ser Leu Val Leu Ala Gly Thr Thr Gly Glu Ser Pro Thr Thr 
50 55 60 

ACC GCC GCT GAA AAA CTA GAA CTG CTC AAG GCC GTT CGT GAG GAA GTT 239 
Thr Ala Ala Glu Lys Leu Glu Leu Leu Lys Ala Val Arg Glu Glu Val 
65 70 75 



GGG GAT CGG GCG AAG CTC ATC GCC 
Gly Asp Arg Ala Lys Leu lie Ala 
80 85 

ACA TCT GTG GAA CTT GCG GAA GCT 
Thr Ser Val Glu Leu Ala Glu Ala 
100 



GGT GTC GGA ACC AAC AAC ACG CGG 287 
Gly Val Gly Thr Asn Asn Thr Arg 
90 95 

GCT GCT TCT GCT GGC GCA GAC GGC 335 
Ala Ala Ser Ala Gly Ala Asp Gly 
105 110 
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CTT XT A GTT GTA ACT CCT TAT TAC TCC AAG CCG AGC CAA GAG GGA TTG 383 
Leu Leu Val Val Thr Pro Tyr Tyr Ser Lys Pro Ser Gin Glu Gly Leu 
115 120 125 

CTG GCG CAC TTC GGT GCA ATT GCT GCA GCA AGA GAG GTT CCA ATT TGT 431 
Leu Ala His Phe Gly Ala lie Ala Ala Ala Thr Glu Val Pro lie Cys 
130 135 140 

CTC TAT GAC ATT CCT GGT CGG TCA GGT ATT CCA ATT GAG TCT GAT ACC 479 
Leu Tyr Asp lie Pro Gly Arg Ser Gly He Pro He Glu Ser Asp Thr 
145 150 155 

ATG AGA CGC CTG AGT GAA TTA CCT ACG ATT TTG GCG GTC AAG GAC GCC 527 
Met Arg Arg Leu Ser Glu Leu Pro Thr He Leu Ala Val Lys Asp Ala 
160 165 170 175 

AAG GGT GAC CTC GTT GCA GCC ACG TCA TTG ATC AAA GAA ACG GGA CTT 575 
Lys Gly Asp Leu Val Ala Ala Thr Ser Leu He Lys Glu Thr Gly Leu 
180 185 190 

GCC TGG TAT TCA GGC GAT GAC CCA CTA AAC CTT GTT TGG CTT GCT TTG 623 
Ala Trp Tyr Ser Gly Asp Asp Pro Leu Asn Leu Val Trp Leu Ala Leu 
195 200 205 

GGC GGA TCA GGT TTC ATT TCC GTA ATT GGA CAT GCA GCC CCC ACA GCA 671 
Gly Gly Ser Gly Phe He Ser Val He Gly His Ala Ala Pro Thr Ala 
210 215 220 

TTA CGT GAG TTG TAC ACA AGC TTC GAG GAA GGC GAC CTC GTC CGT GCG 719 
Leu Arg Glu Leu Tyr Thr Ser Phe Glu Glu Gly Asp Leu Val Arg Ala 
225 230 235 

CGG GAA ATC AAC GCC AAA CTA TCA CCG CTG GTA GCT GCC CAA GGT CGC 767 
Arg Glu lie Asn Ala Lys Leu Ser Pro Leu Val Ala Ala Gin Gly Arg 
240 245 250 255 

TTG GGT GGA GTC AGC TTG GCA AAA GCT GCT CTG CGT CTG CAG GGC ATC 815 
Leu Gly Gly Val Ser Leu Ala Lys Ala Ala Leu Arg Leu Gin Gly He 
260 265 270 

AAC GTA GGA GAT CCT CGA CTT CCA ATT ATG GCT CCA AAT GAG CAG GAA 863 
Asn Val Gly Asp Pro Arg Leu Pro He Met Ala Pro Asn Glu Gin Glu 
275 280 285 

CTT GAG GCT CTC CGA GAA GAC ATG AAA AAA GCT GGA GTT CTA TAA TGAGAATTC 
Leu Glu Ala Leu Arg Glu Asp Met Lys Lys Ala Gly Val Leu * 
290 295 ~ 300 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CTTCCCGTGA CCATGGGCCA TC 22 
(2) INFORMATION FOR SEQ ID NO: 8: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CATGGCTGGC TTCCCCACGA GGAAGACCAA CAATGACATT ACCTCCATTG CTAGCAACGG 60 
TGGAAGAGTA CAATG 75 

□ 

B (2) INFORMATION FOR SEQ ID NO: 9: 

ru 

W (i) SEQUENCE CHARACTERISTICS: 
U (A) LENGTH: 75 base pairs 

(H (B) TYPE: nucleic acid 

01 (C) STRANDEDNESS: single 

e (D) TOPOLOGY: linear 

fU (ii) MOLECULE TYPE: DNA (genomic) 

SI (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

\A CATGCATTGT ACTCTTCCAC CGTTGCTAGC AATGGAGGTA ATGTCATTGT TGGTCTTCCT 60 

CGTGGGGAAG CCAGC 75 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH - : 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CATGGCTTCC TCAATGATCT CCTCCCCAGC TGTTACCACC GTCAACCGTG CCGGTGCCGG 60 
CATGGTTGCT CCATTCACCG GCCTCAAAAG 90 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CATGCTTTTG AGGCCGGTGA AT GGAGCAAC CATGCCGGCA CCGGCACGGT TGACGGTGGT 
AACAGCTGGG GAGGAGATCA TTGAGGAAGC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12: 
CCGGTTTGCT GTAATAGGTA CCA 23 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AGCTTGGTAC CTATTACAGC AAACCGGCAT G 31 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GCTTCCTCAA TGATCTCCTC CCCAGCT 27 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CATTGTACTC TTCCACCGTT GCTAGCAA 28 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..20 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standardjname- "SM 
70" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CTGACTCGCT GCGCTCGGTC 20 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..24 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name— "SM 
71" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TATTTTCTCC TTACGCATCT GTGC 24 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
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( C) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide 1 * 
/standard_name« "SM 
78" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TTCATCGATA GGCGACCACA CCCGTCC 27 
(2) INFORMATION FOR SEQ ID NO: 19: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

MOLECULE TYPE: DNA (genomic) 
FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION: 1 . . 27 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name- "SM 
79" 

SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AATATCGATG CCACGATGCG TCCGGCG 27 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE : 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1 . . 55 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standardjaame- "SM 
81" 



G 
O 

m 
y 

Q 
m 



|4 



(i) 



(ii) 
(ix) 



(xi) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



CATGGAGGAG AAGATGAAGG CGATGGAAGA GAAGATGAAG GCGTGATAGG TACCG 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..55 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide n 
/standardjname— "SM 
80" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AATTCGGTAC CTATCACGCC TTCATCTTCT CTTCCATCGC CTTCATCTTC TCCTC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..14 

(D) OTHER INFORMATION: /label- name 

/note- "base gene 
[(SSP5)2] n 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
15 10 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) 



FEATURE : 



(A) 
(B) 
(D) 



NAME/KEY: misc_f eature 

LOCATION: 1 . .21 

OTHER INFORMATION: /product- 



"synthetic 
oligonucleotide" 
/standard_name- "SM 
84" 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



GATGGAGGAG AAGAT GAAGG C 



21 



<2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1. .21 

(D) OTHER INFORMATION: /product- "synthetic 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product- "synthetic 



oligonucleotide" 
/standard__name« "SM 
85" 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO:24: 



ATCGCCTTCA TCTTCTCCTC C 



21 



oligonucleotide" 

/ standard_name— " SM 

82" 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



GATGGAGGAG AAGCTGAAGG C 



21 
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(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name«=» "SM 
83" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
ATCGCCTTCA GCTTCTCCTC C 21 
(2) INFORMATION FOR SEQ ID NO: 27: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) * SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Met Glu Glu Lys Leu Lys Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Glu Glu Lys Met Lys Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 base pairs 

(B) TYPE: nucleic acid 
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(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: CI 5 

<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 151 

(D) OTHER INFORMATION: /function- "synthetic 

storage protein" 
/product- "protein" 
/gene- "ssp" 
/ standard_name— 
"5.7.7.7.7.7.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 46 
Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met 
1 5 10 15 

GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG 94 
Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu 
20 25 30 

AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAA GAG AAG ATG 142 
Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Met 
35 40 45 

AAG GCG TGATAGGTAC CG 160 
Lys Ala 

50 

<2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu 
15 10 15 

Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys 
20 25 30 
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Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Met Lys 
35 40 45 

Ala 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. COli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: C20 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 ,.151 

(D) OTHER INFORMATION: /function- "synthetic 

storage protein" 
/product- "protein" 
/gene- "ssp" 
/ s tandard_name— 
"5.7.7.7.7.7.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 
Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met 
15 10 15 

GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG 
Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu 
20 25 30 

AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAA GAG AAG ATG 
Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Met 
35 40 45 

AAG GCG TGATAGGTAC CG 
Lys Ala 

50 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu 
1 5 10 15 

Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys 
20 25 30 

Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Met Lys 
35 40 45 

Ala 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 base pairs 
M= (B) TYPE: nucleic acid 

p (C) STRANDEDNESS : double 

p (D) TOPOLOGY: linear 

m • 

yj (ii) MOLECULE TYPE: DNA (genomic) 

Q 

?ff (vi) ORIGINAL SOURCE: 

?fl (B) STRAIN: E. coli 

g (G) CELL TYPE: DH5 alpha 

fy (vii) IMMEDIATE SOURCE: 

il (B) CLONE: C30 

C S" (ix) FEATURE: 

h] • (A) NAME/KEY: CDS 

f (B) LOCATION: 2 . . 130 

(D) OTHER INFORMATION: /function- "synthetic 

storage protein" 
/product- "protein" 
/gene= "ssp" 
/ s t anda r d_name— 
"5.7.7.7.7.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 46 
Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met 
15 10 15 

GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG 94 
Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu 
20 25 30 

AAG CTG AAG GCG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC CG 139 
Lys Leu Lys Ala Met Glu Glu Lys Met Lys Ala 
35 40 
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(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu 
1 5 10 15 

Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys 
20 25 30 

Leu Lys Ala Met Glu Glu Lys Met Lys Ala 
35 40 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E . coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: D16 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.-88 

(D) OTHER INFORMATION: /function- "synthetic 

storage protein" 
/product- "protein" 
/gene— "ssp" 
/ s tandard_name» 
5 ■ 5 • 5 • 5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG 4 6 
Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met 
15 10 15 

GAG GAG AAG ATG AAG GCG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC 95 
Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
20 25 

CG 97 
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(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu 
15 10 15 

Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 37: 

(±) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 base pairs 

(B) TYPE: nucleic acid 



P 
C3 

m 

W (C) STRANDEDNESS : double 

P (D) TOPOLOGY: linear 

m 

CH (ii) MOLECULE TYPE: DNA (genomic) 

3 

M* (vi) ORIGINAL SOURCE: 

fil (B) STRAIN: E. coli 

H» (G) CELL TYPE: DH5 alpha 

% 4 . 

£3 (vii> IMMEDIATE SOURCE: 

\A (B) CLONE: D20 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 109 

(D) OTHER INFORMATION: /function- "synthetic 

storage protein" 
/product- "protein" 
/gene— "ssp" 
/ s tandard_name— 
5 • 5 • 5 ■ 5 • 5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG 4 6 
Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met 
15 10 15 

GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG GAA GAG 94 
Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu Glu 
20 25 30 

AAG ATG AAG GCG TGATAGGTAC CG 118 
Lys Met Lys Ala 
35 
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(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid. 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu 
15^ 10 15 

Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys 
20 25 30 

Met Lys Ala 
35 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: D33 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 88 

(D) OTHER INFORMATION: /function- "synthetic 

storage protein" 
/product- "protein" 
/gene- "ssp" 
/standard_name« 
"5.5.5,5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG 
Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met 
1 5 10 15 

GAG GAG AAG ATG AAG GCG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC 
Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
20 25 
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CG 97 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu 
1 5 10 15 

Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

. (D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name= "SM 
86" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GATGGAGGAG AAGCTGAAGA A 21 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standardjname- "SM 
87" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



ATCTTCTTCA GCTTCTCCTC C 21 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide " 
/standard_name- "SM 
88" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
GATGGAGGAG AAGCTGAAGT G 21 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

• (C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name— "SM 
89" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
ATCCACTTCA GCTTCTCCTC C 21 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 
<B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name- "SM 
90" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GATGGAGGAG AAGATGAAGA A 21 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY: mis cofeature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name- "SM 
91" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
ATCTTCTTCA TCTTCTCCTC C 21 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name- "SM 
92" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
GATGGAGGAG AAGATGAAGT G 21 
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(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
2>XKAWU£;uNJ!ii>i) : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAMJii/ K£X : miscjteature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard__name« "SM 
93" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
ATCCACTTCA TCTTCTCCTC C 21 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Met Glu Glu Lys Leu Lys Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Glu Glu Lys Leu Lys Trp 
1 5 

<2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: unknown 

(D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met Glu Glu Lys Met Lys Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Glu Glu Lys Met Lys Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: 82-4 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 2.. 151 

(D) OTHER INFORMATION: /function- "synthetic 

storage protein 
/product- "protein" 
/gene- "ssp" 
/standard_name- 
"7. 7. 1.1. 1. 1. 5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

C ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 
Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met 
15 10 15 
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GAG GAG AAG CTG AAG GCG ATG GAG GAG 
Glu Glu Lys Leu Lys Ala Met Glu Glu 
20 



AAG CTG AAG GCG ATG GAG GAG 94 
Lys Leu Lys Ala Met Glu Glu 
25 30 



AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAA GAG AAG ATG 142 
Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Met 
35 40 45 

AAG GCG TGATAGGTAC CG 160 
Lys Ala 

50 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu 
1 5 10 15 

Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys 
20 25 30 

Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Met Lys 
35 40 45 

Ala 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: 84-H3 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 88 
(D) OTHER INFORMATION: 



/function- "synthetic 
storage protein 
/product- "protein" 
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/gene- n ssp n 

/ s t a nda r d_name» 

"5,5.5.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG 46 
Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met 
1 " 5 10 15 

GAG GAG AAG ATG AAG GCG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC 95 
Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 

20 25 



CG 
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(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu 
15 10 15 

Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 57: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E . coli 

(G) CELL TYPE: DH5 alpha 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: 86-H23 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 88 

(D) OTHER INFORMATION: /function- "synthetic 

storage protein 
/product- "protein" 
/gene- "ssp" 
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/ s tandard_name- 
"5.8.8.5" 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG CTG AAG AAG ATG 46 
Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Lys Met 
1 5 10 15 

GAG GAG AAG CTG AAG AAG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC 95 
Glu Glu Lys Leu Lys Lys Met Glu Glu Lys Met Lys Ala 
20 25 

CG 97 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

H; (A) LENGTH: 28 amino acids 

0 (B) TYPE: amino acid 
Q (D) TOPOLOGY: linear 

ry 

y (ii) MOLECULE TYPE: protein 

o 

01 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

en 

s Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Lys Met Glu 

1 5 10 15 

ry 

Glu Lys Leu Lys Lys Met Glu Glu Lys Met Lys Ala 

S\ 20 25 

2 (2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: 88-2 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 103 

(D) OTHER INFORMATION: /function- "synthetic 

storage protein 
/product- "protein" 
/gene- "ssp" 
/standard_name« 
"5.9.9.9.5" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

C ATG GAG GAG AAG ATG AAG GCG AAG AAG CTG AAG TGG ATG GAG GAG 4 6 
Met Glu Glu Lys Met Lys Ala Lys Lys Leu Lys Trp Met Glu Glu 
1^5 10 15 

AAG CTG AAG TGG ATG GAG GAG AAG CTG AAG TGG ATG GAA GAG AAG ATG 94 
Lys Leu Lys Trp Met Glu Glu Lys Leu Lys Trp Met Glu Glu Lys Met 
20 25 30 

AAG GCG TGATAGGTAC CG 112 
Lys Ala 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Met Glu Glu Lys Met Lys Ala Lys Lys Leu Lys Trp Met Glu Glu Lys 
1 * 5 10 15 

Leu Lys Trp Met Glu Glu Lys Leu Lys Trp Met Glu Glu Lys Met Lys 
20 25 30 

Ala 

<2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: 90-H8 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2.. 10 9 

(D) OTHER INFORMATION: /function- "synthetic 

storage protein 
/product- "protein" 
/gene- "ssp" 
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/ s tandard_name- 
"5.10.10.10.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG AAG ATG 46 
Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Lys Met 
1 5 10 15 

GAG GAG AAG ATG AAG AAG ATG GAG GAG AAG ATG AAG AAG ATG GAA GAG 94 
Glu Glu Lys Met Lys Lys Met Glu Glu Lys Met Lys Lys Met Glu Glu 
20 25 30 

AAG ATG AAG GCG TGATAGGTAC CG 118 
Lys Met Lys Ala 
35 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Lys Met Glu 
! 5 10 15 

Glu Lys Met Lys Lys Met Glu Glu Lys Met Lys Lys Met Glu Glu Lys 
• 20 25 30 

Met Lys Ala 
35 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: 92-2 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 88 
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(D) OTHER INFORMATION: /function- "synthetic 

storage protein 
/product- "protein" 
/gene- "ssp" 
/ st andard_name- 
"5.11. 11-5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG TGG ATG 4 6 
Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Trp Met 
15 10 15 

GAG GAG AAG ATG AAG TGG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC 95 
Glu Glu Lys Met Lys Trp Met Glu Glu Lys Met Lys Ala 
20 25 

CG 97 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Trp Met Glu 
15 10 15 

Glu Lys Met Lys Trp Met Glu Glu Lys Met Lys Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION: 1..84 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard__name- "SM 
96" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
GATGGAGGAA AAGATGAAGG CGATGGAGGA GAAAATGAAA GCTATGGAGG AAAAGATGAA 60 
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AGCGATGGAG G AGAAAAT G A AGGC 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 1..84 

(D) OTHER INFORMATION: /product* "synthetic 

oligonucleotide" 
/standard_name=* "SM 
97" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
ATCGCCTTCA TTTTCTCCTC CATCGCTTTC ATCTTTTCCT CCATAGCTTT CATTTTCTCC 60 
TCCATCGCCT TCATCTTTTC CTCC 84 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 
, (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY : Protein 

(B) LOCATION: 1.-28 

(D) OTHER INFORMATION : /label- name 

/note- "(SSP 5)4" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu 
! 5 10 15 

Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



84 
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(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..84 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name- "SM 
98" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
GATGGAGGAA AAGCTGAAAG CGATGGAGGA GAAACTCAAG GCTATGGAAG AAAAGCTTAA 60 
AGCGATGGAG GAGAAACTGA AGGC 84 
(2) INFORMATION FOR SEQ ID NO: 69: 

(i.) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..84 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name- "SM 
99" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
ATCGCCTTCA GTTTCTCCTC CTACGCTTTA AGCTTTTCTT CCATAGCCTT GAGTTTCTCC 60 
TCCATCGCTT TCAGCTTTTC CTCC 84 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..28 

(D) OTHER INFORMATION: /label- name 

/note- " (SSP 7)4" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
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- ' 

Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu 
1 5 10 15 

Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala 
20 2 5 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..84 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name- "SM 
100" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
GATGGAGGAA AAGCTTAAGA AGATGGAAGA AAAGCTGAAA TGGATGGAGG AGAAACTCAA 60 
AAAGATGGAG GAAAAGCTTA AATG 84 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) , SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..84 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name- "SM 
101" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

ATCCATTTAA GCTTTTCCTC CTACTTTTTG AGTTTCTCCT CCATCCATTT CAGCTTTTCT 60 

TCCATCTTCT TAAGCTTTTC CTCC 84 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
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(B) TYPE: amino acid 

(C) STRAND EDNESS : unknown 

(D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Met Glu Glu Lys Leu Lys Lys Met Glu Glu Lys Leu Lys Trp Met Glu 
1 5 10 15 



Glu Lys Leu Lys Lys Met Glu Glu Lys Leu Lys Trp 
20 25 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 243 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: 2-9 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . . 235 
(D) OTHER INFORMATION: 



/function- "synthetic 
storage protein 
/product- "protein" 
/gene- "ssp" 
/ standard^ name- 
"7. 7. 7. 7.7. 7. 8. 9. 8. 9-5" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

C ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 
Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met 
1 5 10 15 

GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG 
Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu 
20 25 30 

AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAG GAA AAG CTT 
Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu 
35 40 45 

AAG AAG ATG GAA GAA AAG CTG AAA TGG ATG GAG GAG AAA CTC AAA AAG 
Lys Lys Met Glu Glu Lys Leu Lys Trp Met Glu Glu Lys Leu Lys Lys 
50 55 60 
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ATG GAG GAA AAG CTT AAA TGG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC 242 
Met Glu Glu Lys Leu Lys Trp Met Glu Glu Lys Met Lys Ala 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu 
1 5 10 15 

Glu Lys Leu, Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys 
20 25 30 

Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys 
35 40 45 

Lys Met Glu Glu Lys Leu Lys Trp Met Glu Glu Lys Leu Lys Lys Met 
50 55 60 

Glu Glu Lys Leu Lys Trp Met Glu Glu Lys Met Lys Ala 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 175 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E . coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: 5-1 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 2.. 172 
(D) OTHER INFORMATION: 



/function- "synthetic 
storage protein 
/product- "protein" 
/gene- "ssp" 
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/ s tandar d_ name— 
"5. 5. 5. 7.7. 7. 7. 5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG 46 
Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met 
15 10 15 

GAG GAG AAG ATG AAG GCG ATG GAG GAA AAG CTG AAA GCG ATG GAG GAG 94 
Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu 
20 25 30 



AAA CTC AAG 
Lys Leu Lys 



AAG GCC ATG 
Lys Ala Met 
50 



GOT ATG GAA 
Ala Met Glu 
35 

GAA GAG AAG 
Glu Glu Lys 



GAA AAG CTT 
Glu Lys Leu 
40 

ATG AAG GCG 
Met Lys Ala 
55 



(2) INFORMATION FOR SEQ ID NO: 77: 



Met 
1 

Glu 

Leu Lys Ala Met Glu Glu Lys Leu 
35 40 

Ala Met Glu Glu Lys Met Lys Ala 
50 55 

(2) INFORMATION FOR SEQ ID NO 



AAA GCG ATG GAG GAG AAA CTG 142 
Lys Ala Met Glu Glu Lys Leu 
45 

TGATAG 179 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu 
5 10 15 

Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys 
20 25 30 

Lys Ala Met Glu Glu Lys Leu Lys 
45 



:78: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.-173 

<D) OTHER INFORMATION: /function- "synthetic 

storage protein 
/product- "protein" 
/gene- "ssp" 
/ standard_name- 
"SSP-3-5 n 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: 

CC ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 47 
Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met 
1 5 10 15 

GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG 95 
Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu 
20 25 30 

□ AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAG GAA AAG ATG 143 

O Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Met 

III 35 40 45 

O AAG GCG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC CGAATTC 187 

yl Lys Ala Met Glu Glu Lys Met Lys Ala 

CH .50 55 

5 

\A (2) INFORMATION FOR SEQ ID NO: 79: 

m 

y, (i) SEQUENCE CHARACTERISTICS: 
Sj (A) LENGTH: 56 amino acids 

p . (B) TYPE: amino acid 

U, (D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu 
! " 5 10 15 

Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys 
20 25 30 

Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Met Lys 
35 40 45 

Ala Met Glu Glu Lys Met Lys Ala 
50 55 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: 1 . . 61 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name- "SM 
107" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
CATGGAGGAG AAGATGAAAA AGCTCGAAGA GAAGATGAAG GTCATGAAGT GATAGGTACC 60 



(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: 1 . . 61 

(D) OTHER INFORMATION: /product- "synthetic 

ligonucleotide" 
/standard_name- "SM 
106" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
AATTCGGTAC CTATCACTTC ATGACCTTCA TCTTCTCTTC GAGCTTTTTC ATCTTCTCCT 60 



(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..16 
(D) OTHER INFORMATION: 



/label- name 

/note- "pSK34 bas 
gene" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



Met Glu Glu Lys Met Lys Lys Leu Glu Glu Lys Met Lys Val Met Lys 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..63 

(D) OTHER INFORMATION: /product- " synthetic 

oligonucleotide" 

/ s t anda r d_name- "SM 

110" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

GCTGGAAGAA AAGATGAAGG CTATGGAGGA CAAGATGAAA TGGCTTGAGG AAAAGATGAA 60 

GAA 63 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 
, (A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION: 1 . . 63 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name- "SM 
111" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
AGCTTCTTCA TCTTTTCCTC AAGCCATTTC ATCTTGTCCT CCATAGCCTT CATCTTTTCT 60 
TCC 63 
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(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Met Glu Glu Lys Met Lys Lys Leu Glu Glu Lys Met Lys Ala Met Glu 
15 10 15 

Asp Lys Met Lys Trp Leu Glu Glu Lys Met Lys Lys Leu Glu Glu Lys 
20 25 30 

Met Lys Val Met Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Met Glu Glu Lys Met Lys Lys Leu Glu Glu Lys Met Lys Ala Met Glu 
1 .5 10 15 

Asp Lys Met Lys Trp Leu Glu Glu Lys Met Lys Lys Leu Glu Glu Lys 
20 25 30 

Met Lys Val Met Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1 . . 62 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucl tide" 
/standard_name— "SM 
112" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
GCTCGAAGAA AGATGAAGGC AATGGAAGAC AAAATGAAGT GGCTTGAGGA GAAAATGAAG 60 
AA 62 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

<ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION: 1 . . 62 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide ■ 
/ s t andar d_name- "SM 
113" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
AGCTTCTTCA TTTTCTCCTC AAGCCACTTC ATTTTGTCTT CCATTGCCTT CATCTTTCTT 60 
CG 62 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) . SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Met Glu Glu Lys Met Lys Lys Leu Lys Glu Glu Met Ala Lys Met Lys 
1 5 10 15 

Asp Glu Met Trp Lys Leu Lys Glu Glu Met Lys Lys Leu Glu Glu Lys 
20 25 30 

Met Lys Val Met Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1 . . 63 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard^ name- n SM 
114" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
GCTCAAGGAG GAAATGGCTA AGATGAAAGA CGAAATCTGG AAACTGAAAG AGGAAATGAA 60 
GAA 63 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..63 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name- "SM 
115" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
AGCTTCTTCA TTTCCTCTTT CAGTTTCCAC ATTTCGTCTT TCATCTTAGC CATTTCCTCC 60 
TTG 63 
(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Met Glu Glu Lys Met Lys Lys Leu Lys Glu Glu Met Ala Lys Met Lys 
1 5 10 15 

Asp Glu Met Trp Lys Leu Lys Glu Glu Met Lys Lys Leu Glu Glu Lys 
20 25 30 

Met Lys Val Met Glu Glu Lys Met Lys Lys Leu Glu Glu Lys Met Lys 
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35 40 45 

Ala Met Glu Asp Lys Met Lys Trp Leu Glu Glu Lys Met Lys Lys Leu 
50 55 60 

Glu Glu Lys Met Lys Val Met Glu Glu Lys Met Lys Lys Leu Glu Glu 
65 70 75 80 

Lys Met Lys Ala Met Glu Asp Lys Met Lys Trp Leu Glu Glu Lys M t 
85 90 95 

Lys Lys Leu Glu Glu Lys Met Lys Val Met Lys 
100 105 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 839 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

GGATCCCCCG GGCTGCAGGA ATTCTACGTA CCATATAGTA AGACTTTGTA TATAAGACGT 60 

CACCTCTTAC GTGCATGGTT ATATGTGACA TGTGCAGTGA CGTTGTACCA TATAGTAAGA 120 

CTTTGTATAT AAGACGTCAC CTCTTACGTG CATGGTTATA TGTGACATGT GCAGTGACGT 180 

TAACCGCACC CTCCTTCCCG TCGTTTCCCA TCTCTTCCTC CTTTAGAGCT ACCACTATAT 240 

AAATCAGGGC TCATTTTCTC GCTCCTCACA GGCTCATCAG CACCCCGGCA GTGCCACCCC 300 

GACTCCCTGC ACCTGCCATG GGTACGCTAG CCCGGGAGAT CTGACAAAGC AGCATTAGTC 360 

CGTTGATCGG TGGAAGACCA CTCGTCAGTG TTGAGTTGAA TGTTTGATCA ATAAAATACG 420 

GCAATGCTGT AAGGGTTGTT TTTTATGCCA TTGATAATAC ACTGTACTGT TCAGTTGTTG 480 

AACTCTATTT CTTAGCCATG CCAGTGCTTT TCTTATTTTG AATAACATTA CAGCAAAAAG 540 

TTGAAAGACA AAAAAANNNN NCCCCGAACA GAGTGCTTTG GGTCCCAAGC TTCTTTAGAC 600 

TGTGTTCGGC GTTCCCCCTA AATTTCTCCC CTATATCTCA CTCACTTGTC ACATCAGCGT 660 

TCTCTTTCCC CTATATCTCC ACGCTCTACA GCAGTTCCAC CTATATCAAA CCTCTATACC 720 

CCACCACAAC AATATTATAT ACTTTCATCT TCACCTAACT CATGTACCTT CCAATTTTTT 780 

TCTACTAATA ATTATTTACG TGCACAGAAA CTTAGGCAAG GGAGAGAGAG AGCGGTACC 839 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

159 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

CTAGAAGCCT CGGCAACGTC AGCAACGGCG GAAGAATCCG GTG 43 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

CATGCACCGG ATTCTTCCGC CGTTGCTGAC GTTGCCGAGG CTT 43 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GATCCCATGG CGCCCCTTAA GTCCACCGCC AGCCTCCCCG TCGCCCGCCG CTCCT 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

CTAGAGGAGC GGCGGGCGAC GGGGAGGCTG GCGGTGGACT TAAGGGGCGC CATGG 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

CATGGCGCCC ACCGTGATGA TGGCCTCGTC GGCCACCGCC GTCGCTCCGT TCCAGGGGC 
59 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

TTAAGCCCCT GGAACGGAGC GACGGCGGTG GCCGACGAGG CCATCATCAC GGTGGGCGC 
59 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) * MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

GCGCCCACCG TGATGA 16 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

CACCGGATTC TTCCGC 16 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 

S3 



<ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 : 
GTAAGATTGG TAAAGTCCAG CAAGAAAATG AGATAAAAGA GAAGCCTGAA ATGACGAAAA 60 

5 

AATCAGGTGT TTTGATTCTT GGTGCTGGAC GTGTGTNTCG CCCAGCTGCT GATTTCCTAG 120 
CTTCAGTTAG AACCATTTCG TCACAGCAAT GGTACAAAAC ATATTTCGGA GCAGACTCTG 180 
10 AAGAGAAAAC AGATGTTCAT GTGATTGTCG CGTCTCTGTA TCTTAAGGAT GCCAAAGAGA 240 
CGGTTGAAGG TATTTCAGAT GTAGAAGCAG TTCGGCTAGA TGTATCTGAT AGTGAAAGTC 300 
TCCTTAAGTA TGTTTCTCAG GTTGATGTTG TCCTAAGTTT ATTACCTGCA AGTTGTCATG 360 
CTTGTTGTAG CA 372 
(2) INFORMATION FOR SEQ ID NO: 103: 



y 20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
s (D) TOPOLOGY: linear 

H* 

HI (ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

GGAAGCACAC TGCGACTCTT TTGGAATTCG GGGACATCAA GAATGGACAA ACAACAACCG 60 

CTATGGCCAA GACTGTTGGG ATCCCTGCAG CCATTGGAGC TCTGCTGTTA ATTGAAGACA 120 

30 AGATCAAGAC AAGAGGAGTC TTAAGGCCTC TCGAAGCAGA GGTGTATTTG CCAGCTTTGG 180 

ATATATTGCA AGCATATGGT ATAAAGCTGA TGGAGAAGGC AGAATGATCA AAGAACTCTG 240 

TATATTGTTT CTNTCTATAA CTTGGAGTTG GAGACAAAGC TGAAGGAGNC AGNGCCATTA 300 

35 

GACCAGCAAA AAAAGGAGGA GGA 323 
(2) INFORMATION FOR SEQ ID NO: 104: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

Lys lie Gly Lys Val Gin Gin Glu Asn Glu lie Lys Glu Lys Pro Glu 
1 5 10 15 

Met Thr Lys Lys Ser Gly Val Leu lie Leu Gly Ala Gly Arg Val Xaa 
20 25 30 

Arg Pro Ala Ala Asp Phe Leu Ala Ser Val Arg Thr lie Ser Ser Gin 
35 40 45 

Gin Trp Tyr Lys Thr Tyr Phe Gly Ala Asp Ser Glu Glu Lys Thr Asp 
50 55 60 

Val His Val He Val Ala Ser Leu Tyr Leu Lys Asp Ala Lys Glu Thr 
65 70 75 80 

Val Glu Gly He Ser Asp Val Glu Ala Val Arg Leu Asp Val Ser Asp 
85 90 95 

Ser Glu Ser Leu Leu Lys Tyr Val Ser Gin Val Asp Val Val Leu Ser 
100 105 110 

Leu Leu Pro Ala Ser Cys His Ala Cys Cys Ser 
115 120 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

Lys His Thr Ala Thr Leu Leu Glu Phe Gly Asp He Lys Asn Gly Gin 
1 5 10 15 

Thr Thr Thr Ala Met Ala Lys Thr Val Gly He Pro Ala Ala He Gly 

20 25 • 30 

Ala Leu Leu Leu He Glu Asp Lys He Lys Thr Arg Gly Val Leu Arg 
35 40 45 

Pro Leu Glu Ala Glu Val Tyr Leu Pro Ala Leu Asp He Leu Gin Ala 
50 55 60 

Tyr Gly He Lys Leu Met Glu Lys Ala Glu 
65 70 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 
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STRAND EDNESS : Single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106 



ATTCCCCATG GTTTCGCCGA CGAAT 



(2) INFORMATION FOR SEQ ID NO: 107: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107 
CTCTCGGTAC CTAGTACCTA CTGATCAAC 
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